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TESTS  FOR  THE  GAMMA  DISTRIBUTION  WITH 


ESTIMATED  SHAPE  PARAMETERS 
By 

R.A.  Lockhart  and  M.A.  Stephens 


Summary. 


It  is  well  known  that  the  distributions  of  the  goodness-of-f it 
2  2  2 

statistics  W  ,  U  ,  and  A  do  not  depend  on  location  and  scale 
parameters  even  when  these  parameters  must  be  estimated.  It  has 
generally  been  assumed,  however,  that  when  shape  parameters  must 
be  estimated  the  null  distributions  of  these  statistics  would  depend 
too  severely  on  the  unknown  parameters  to  permit  their  use  in  practice. 
Exact  asymptotic  distributions  of  these  statistics  are  presented  for 
the  two  parameter  gamma  family  of  distributions.  Critical  points  are 
given  in  tables  constructed  so  as  to  avoid  any  need  for  extrapolation. 
The  results  presented  show  that  the  tests  can  be  expected  to  be  useful 
in  practice. 
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Introduction. 


Goodness-of-f it  tests  such  as  the  Craraer-Von  Mises  statistic  W 

are  distribution  free  when  the  distribution  being  tested  is  completely 

specified.  When  parameters  must  be  estimated,  as  when  testing  for  the 

normal  family,  the  situation  is  not  so  nice.  If  the  parameters  are  those 

o 

of  location  and  scale  the  null  distribution  of  W“  will  depend  on  the 
particular  family  being  tested  but  not  on  the  exact  values  of  the  para¬ 
meters  being  estimated.  When  a  shape  parameter  must  be  estimated,  as  in 
the  gamma  family,  the  distribution  of  the  test  statistic  will  generally 
depend,  even  asymptotically,  on  the  unknown  true  value  of  the  shape 
parameter.  In  this  paper  we  explore  the  extent  of  the  dependence  of 
critical  points  on  the  shape  parameter  in  the  two-parameter  gamma  family. 

Our  major  conclusion  is  that  the  dependence  is  small  enough  to  make  the 
tests  useful  in  practice. 

In  this  paper  we  discuss  the  behavior  of  the  goodness-of-f it  statistics 
2  2  2 

W  ,  U  ,  and  A  for  testing  the  hypothesis  H  :  that  a  random  sample  of 

o 

size  n  comes  from  the  gamma  distribution  with  density 

(1.1)  f  (>:;a,S)  =  T  (a)  ^3  axa  '*"exp(-x/8)  0  <  x  ■'  *  . 

Following  Stephens  (1976)  we  distinguish  four  situations:  Case  0,  where 
both  ol  and  o  arc  known;  Case  1,  where  3  is  known  but  a  must  be 
estimated;  Case  2,  where  a  is  known  but  S  must  be  estimated  and  Case  3, 


where  both  parameters  are  unknown  and  must  be  estimated.  Case  2  has  been 
studied  by  Pettitt  and  Stephens  (1975).  In  this  paper  we  concentrate  on 
Case  3  but  include  the  critical  points  for  cases  1  and  2  for  che  sake  of 
completeness.  Those  for  Case  2  have  been  recomputed. 

In  section  2  we  review  briefly  the  relevant  theory  for  Case  3  in 
order  to  fix  notation  and  to  record  the  relevant  formulae  for  the  gamma 
distribution.  In  section  3  we  discuss  the  problem  of  computing  the 
asymptotic  critical  points  of  the  statistics  and  tabulate  the  results. 

An  important  feature  of  these  tables  is  that  there  is  never  any  need  for 
extrapolation — only  interpolation.  Section  3  contains  the  relevant  theory 
as  well  as  some  useful  new  identities  for  computing  asymptotic  moments  of 
the  statistics.  Some  general  conclusions  and  speculations  are  in  section  4. 


Suppose  X^,X2» . . • ,Xn  are  a  sample  from  the  gamma  distribution  (1.1) 
with  distribution  function  F(x,9)  where  9  =  (a,g)  is  in  (0,°°)  x  (0,°°)  . 
Let  be  the  empirical  distribution  function.  Let  9  =  (a,8)  be  the 

maximum  likelihood  estimates  of  (ot, 8) — see  Johnson  and  Kotz  (1970,  page 
187).  The  Cramer-Von  Mises  statistic  is  defined  by 


(2.1) 


W2  =  n 


(F  (x)-F(x,8)}  F(dx,9) 
n 


(See  Stephens  (1977)  for  compui_ir.g  formulae.)  The  asymptotic  theory 
reproduced  here  is  now  standard — see  for  example  Stephens  (1976). 


2 


Asymptotically  (2.1)  is  equivalent  to 


(2.2) 

where 


n 


,oo 


J  —00 


{Fn(x)-F(x,0)}2F(dx,e) 


Yn(t)dt 
0  n 


(2.3)  Y  (t)  =  n^EF  {F‘1(t)}-FfF‘1(t),0)]  . 

n  no  u 

Here  F  ^(t)  is  the  inverse  of  the  map  x  F(x,0). 

The  process  converges  weakly  in  D[0,1]  to  a  mean  zero  Gaussian 

process  Y(t).  The  covariance  of  Y  is  given  by 


(2.4)  p(s,t)  =  p0(s,t)  -  <j>(s)T$<i(t) 

where  Pq(s,0  =  min(s,t)-st  is  the  covariance  of  the  Brownian  bridge 

2 

appropriate  for  W  in  Case  0,  $  is  the  inverse  of  the  Fisher  informa- 

3  -i 

tion  matrix  and  <Ks)  =  -rr-  F(x,0)  evaluated  at  x  =  F.  (s).  For  the 

o  y  y 

gamma  distribution  in  Case  3  we  have 


(2.5) 


6(s)  = 


F(a)  ^  ^  ^  e  Uua  Hnudu-s'l'(ci) 


-r<a)-V«-<0,+1) 


-x/S 


and 


3 


(2.6) 


ac(a) 

-c(a) 

-c  (a) 

c(a)4)'  (a) 

where  tKa)  =  ln(r(a))  is  the  digamma  function,  ip  *  (a.)  is  its  deriva¬ 
tive,  the  trigamma  function,  and  c(a)  =  (a4>'(a)-l)  \  (See  Abramowitz  and 
Stegen  (1965)  for  a  discussion  of  these  functions  and  useful  recurrence 
relations. ) 

2 

For  A  the  situation  is  similar  except  that  Y^  must  be  divided 

_  2 

by  w(s)  =  /s(l-s)  and  p  must  be  divided  by  w(s)w(t).  For  U  ,  Y 

n 

is  replaced  by  Y^(t)  -  Yn(s)ds  and  p(s,t)  is  replaced  by 


pQ(s,t)-(<?(s)  - 


<t>  (u)du)  $(<t>(t)  - 


i  (u)du) 


2  2 

where  now  Pg(s,t)  =  min(s,t)  -  st  +  1/12  -  (s-s  +t-t  )/2. 

For  all  three  statistics  then  the  asymptotic  distribution  is  that  of 

1  2 

;Q  Y  (t)dt  where  Y  is  a  mean  zero  Gaussian  process  with  covariance  p 

T 

of  the  form  p^ (s ,  t )  — (s)  ^<f(t)  where  Pq  is  independent  of  0.  This 

°o 

distribution  is  that  of  Vi'^i  w^ere  t^e  are  independent  and 

identically  distributed  chi-square  random  variables  on  one  degree  of 
freedom  and  0  <  ^  ^  —  * ’ *  are  eigenvalues  of  the  integral 

equation 


(2.7) 


h(s) 


=  X 


r  1 

h(t)p(s,t)dt  . 

J0 
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In  Case  0  the  eigenvalues  and  eigenfunctions  can  be  found  explicitly  for 
the  three  statistics  studied  here  (again  the  reader  is  referred  to  Stephens 


(1976)  for  details  and  history).  We  will  denote  the  Case  0  eigenvalues  by 

*  * 

and  call  them  the  standard  weights.  We  use  f.  to  denote  the  corres- 
i  l 

ponding  orthonormal  eigenf untions . 

* 

Now  expand  (2.4)  in  terms  of  f^.  Call 


(2.8) 


a .  = 
l 


b(s)f i(s)ds  . 


The  eigenvalues  of  (2.7)  are  then  the  roots  of 

(2.9)  D(X)  =  0 


where  D  is  the  Fredholm  determinant  of  (2.7)  given  by 

(2.10)  D(\)  =  D0(A)Det(I  +  l  a^J/ (1-X/X±) ) 

i=l 

with  Dq  the  Case  0  Fredholm  determinant  and  I  the  2x2  identity  matrix. 

3 .  Computation  of  Asymptotic  Critical  Points. 

The  process  of  calculating  the  asymptotic  critical  points  of  these 
statistics  is  long  and  will  be  described  step-by-step.  Since  all  the 
statistics  are  scale  invariant  the  results  do  not  depend  on  6.  Accordingly, 
we  use  8=1  for  the  remainder  of  the  paper. 

Step  1:  Fourier  coefficients. 

The  first  step  is  to  calculate  the  Fourier  coefficients  a^  with  (2.8). 
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2  *  r~ 

For  W  che  eigenfunctions  are  f  (s)  =  v2  sin(iTjs).  With  p(s)  as 

recorded  in  (2.5)  the  resulting  formulae  involve  double  integrals  but 

integration  by  parts  produces  the  following  simpler  formula: 


(3.1) 


a . 
j 


/2  (irjr(a)) 


,00 

cos  (irjF(x,ct,  1) )  e  Xxa  ^ 

J0 


In  x  dx 


~/2  (irjT(a)) 


►OO 

cos(TTjF(x,a}l))e  Xx&dx 

J0 


For  U  the  eigenfunctions  are 


f 2 j  (s)  =  ^2  sin(2irjs) 

and 

f 2j_i (s)  =  */2  cos(2iTjs)  . 


2  2 

Hence  a^  for  U  is  given  by  a for  W  while  after  some  integration 
by  parts  we  obtain 


(3.2)  a2j_1 


-2  1/2(^jr(a))  1 


—x  ct— 1 

sin(2TTjF(x,a,l))e  x  In  x  dx 


/2  T(a) 


-2 


-2x  2a-l, 

cos(2irjF(x,a,l))e  x  dx 


For  A  the  i  eigenfunction  is  the  i  Ferrar  associated  Legendre  function 
(see  Anderson  and  Darling,  1952).  After  some  algebra  and  integration  by  parts 
we  obtain 
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where  c..  =  [  (2j+l)  /  (j  (j  +-1) )  ]  and  p^_  is  the  j Legendre  polynomial 

(see  Abramowitz  and  Stegun  1965,  page  332). 

It  was  judged  necessary  for  the  purposes  of  Step  2  to  obtain  numerical 

values  of  the  integrals  (3.1-3)  for  j  from  1  to  50  and  for  a  variety  of 

shapes  a.  Thus  for  each  value  of  the  shape  there  were  300  numerical 

integrals  to  be  done.  Many  of  the  integrands  involve  similar  functions. 

2 

Moreover,  the  eigenfunctions  for  large  j  may  (and  for  A  must)  be 
evaulated  by  recursion  from  those  for  small  j .  In  order  to  take  advantage 
of  these  properties  we  wrote  a  routine  to  do  Romberg  extrapolation  (see 
Ralston  (1965,  page  121))  of  the  trapezoidal  rule  for  an  array  of  functions. 
For  integer  values  of  a  the  gamma  cumulative  F(x,a,l)  can  be  computed 
explicitly.  We  therefore  computed  the  six  integrals  for  a  from  1  to  20 
and  for  j  from  1  to  50  simultaneously.  The  result  was  an  important  saving 
in  computer  time — the  integrals  still  required  around  10  hours  of  CPU  time 
on  an  IBM  4341. 


Step  2:  Roots  of  the  Fredholm  Determinant. 

The  next  step  is  to  find  the  roots  of  D(A)  =  0.  In  all  cases  the  poles 
in  the  determinant  term  in  D  cancel  the  zeros  in  so  it  suffices  to 

00  Tj,  * 

search  for  the  rocts  of  the  determinant  term.  The  series  E.  .  a. a.  $/( 1-A/X.) 

1=1  1  1  J 

must  be  truncated  somewhere.  We  found  by  experiment  that  400  terms  are  adequate. 


The  difficulty  is  that  we  were  able  to  compute  only  50  aj's'  Provec* 
possible,  however,  to  extrapolate  the  aj's  with  confidence  since  the  aj's 
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were  seen  to  fall  away  smoothly  with  j.  We  therefore  carried  out  the 


We  then  searched 


—  j- 

extrapolation  from  50  to  400  by  fitting  a.  =  k  j 

•k  k 

each  interval  X  <  \  <  X^+^  exhaustively  stopping  when  about  200  roots 
were  found. 


Step  3:  Critical  Values. 

oo 

The  final  step  is  to  compute  the  critical  points  of  Q  =  E.  ,  V./X., 

3=1  3  3 

We  used  two  different  methods.  First,  Pearson  curves  were  fitted  using 
the  first  four  cumulates  of  Q.  Second,  the  Imhof  technique  (see  Imhof, 
1961)  of  numerically  inverting  the  characteristic  function  of  Q  was 
used . 

The  cumulants  needed  for  use  with  Pearson  curves  were  found  several 
ttl 

ways.  First,  the  r  cumulant  is  given  by  (Stephens,  1976): 


(3.4) 


<  =  2r_1(r-l):  l  X“r 

3=1  J 


This  sum  converges  quickly  for  r  >  2  but  for  r  =  1  is  unsatisfactory. 

The  sum  of  those  terms  truncated  away  in  a  finite  approximation  may, 

2 

however,  be  calculated  approximately  using  the  fact  that  j  X^  has  a 
computable  limit. 

A  second  formula  for  is  (Stephens,  1976) 


(3.5) 


< 

r 


f1 

^0 


p  (s , s)ds 


where  p,  =  p  and  p  (s,t)  =  /_  p  (s  ,u)o  (u,  t)du.  For  r=l  and  integral 
1  r+l  U  r 

values  of  a  these  integrals  are  tractable  but  very  messy.  Lengthy 
recurrence  relations  were  developed  (details  may  be  obtained  from  the 
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authors)  for  these  calculations,  reducing  the  calculations  to  gamma, 

2  2 

digamma  and  trigamma  function  evaluations — at  least  for  W  and  U  . 

2 

For  non- integral  a  and  A  even  numerical  integration  seemed  formidable. 

A  third,  r.nd  seemingly  new,  approach  to  the  calculation  of  <r  is 

as  rollows.  Expand  the  integrals  in  (3.5)  in  terms  of  the  eigenfunctions 
* 


f . 
3 


of 


0  * 


The  result  is  the  following: 


o(s,s)ds  = 


0 

r  1 

I 

I 

0 

1 


r  1 


(1 


(s,s)ds  -  l 


3=1 


1  J 


P  (s,s)ds  =  j  Pn  „(s,s)ds  -  2  l  a^a  /X  +  l 

J  J  > 


P^(s,s)ds  = 


Pn  _(s,s)ds-3  £  aT$a./(X  ,)2  + 3  £  aTfa  a.^a./X, 

0  j  J  3  J  3  V  3  J 


(3.6) 


l . 


P^(s,s)ds  = 


j,k,^ 
T 


,  *\  3 


Pn  , (s,s)ds  -  4  l  a  fa  /(X  ) 
U  ^  •  J  J  J 


+  2  1  aT$a.  a^a./X  .X  +  4  5!  a^a,  a^a  / (X  ) 

.  ,  j  k.  k  i  j  k  .  ,  j  k  kT  i  k 
l,k  J  J,k 


-  4  .  J  ,  aj*VktaiaI*aj/i* 

j  ,k,  x.  j  j 


+  l 


j ,k,2,ra 


aT$a  a^|a  a^a  aT$a.  . 
2  k  k  x.  I  m  nr  j 


These  series  have  the  advantage  of  converging  very  quickly  even  for  the 

2 

mean — that  is  r  =  l.  For  non-integral  shapes  and  for  A  these  calculations 
were  the  only  ones  available  for  the  mean.  For  integral  a  and  Case  2 
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where  the  integrals  can  be  done  explicitly  for  r  =  1  (see  Pettitt  and 
Stephens,  1975),  the  series  (3.6)  agreed  very  well  with  the  results  of 
direct  integration  (differing  only  in  the  seventh  or  eighth  significant 
digit).  These  series  also  can  be  used  to  check  the  accuracy  of  the  computa¬ 
tion  of  the  roots  of  (2.9)  since  they  do  not  use  the  computed  values  of  the 
eigenvalues  X  .  Comparison  of  (3.6)  and  (3.4)  suggests  that  Step  2,  the 
calculation  of  the  roots  of  the  Fredholm  determinant,  has  been  performed 
very  accurately.  Finally  it  should  be  remarked  that  the  double,  triple 
and  even  quadruple  sums  in  (3.6)  can  all  be  written  as  products  of  singly 
infinite  sums. 

The  critical  points  were  also  found  using  the  technique  of  Imhof  (1961) 

of  numerical  inversion  of  the  characteristic  function  of  a  weighted  sum 

00 

of  chi-squares.  The  series  l  .  .  V./X.  was  truncated  at  40  terms  and 

J=1  J  J 

the  tail  of  the  sum  replaced  by  its  expected  value.  Since  the  standard 
deviation  of  the  tail  is  small  compared  to  its  mean,  this  approximation 
was  expected  to  perform  well.  The  approximation  was  checked  in  several 
cases  by  using  150  terms  of  the  sum  and  was  found  to  be  excellent. 

The  Pearson  curve  and  Imhof  calculations  were  found  to  agree  well, 
rarely  differing  by  more  than  1  in  the  third  significant  digit.  In  tables 
1,  2,  and  3  we  report  the  Imhof  calculations  as  these  are  based  on  the 
exact  l-  ymptotic  distribution.  For  the  sake  of  reference  we  record  the 
cumulants  of  Q  in  tables  4,  5,  and  6.  Case  1  is  in  tables  1  and  4,  while 
Case  2  is  in  tables  2  and  5  and  Case  3  is  in  tables  3  and  '. 

The  test  is  now  carried  out  as  follows: 

1)  Estimate  whichever  parameters  are  unknown  by  maximum 
likelihood. 
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2)  Order  the  sample  data  points  X,,,  <  X,--.  <  •■•  <X,  ,  and 

(.1;  {/.)  (.n; 

calculate  =  F(X^^,a,B)  where  any  parameter  which 

has  been  estimated  is  replaced  by  that  estimate. 

2  2  2 

3)  Calculate  W  ,  U  ,  or  A  using  the  computing  formulae  of 
Stephens  (1977). 

4)  Enter  the  appropriate  table  using  a,  or  when  this  is 
estimated  its  estimate  a,  interpolating  when  necessary 
to  find  the  appropriate  critical  point.  Linear  interpola¬ 
tion  in  a  is  adequate  except  betw  en  a  =  20  and  a  =  <=° 
where  we  recommend  linear  interpolation  in  1/a. 

The  tests  are  asymptotically  exact  because  a  is  consistent.  In 

finite  samples,  however,  entering  the  tables  using  a  seems  risky  since 

one  will  normally  be  taking  the  critical  point  from  the  wrong  row.  In 

the  present  case,  however,  the  critical  points  are  seen  to  depend  very 

little  on  a,  particularly  for  a  bigger  than  2  or  so.  In  the  case  of 
2 

U  the  dependence  is  only  one  digit  in  the  third  significant  digit.  For  a 
less  than  1  the  critical  points  change  rapidly  with  a  but  it  should 
be  noted  that  the  standard  error  of  a  is  small  for  a  near  0 — partially 
compensating  for  this  defect. 

The  second  source  of  error  in  using  the  tables  is,  of  course,  the  use 
of  asymptotic  theory  in  finite  samples.  A  Monte  Carlo  study  by  Schneider 
and  Clickner  (1977)  shows  the  critical  points  even  for  samples  of  size  20 
are  in  excellent  agreement  with  the  asymptotic  points  at  least  for  shapes 
over  0.5.  For  samples  of  size  10  their  Monte  Carlo  points  are  still 
generally  within  2  or  3  percent  of  our  asymptotic  points.  Schneider  and 
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Clickner  have  studied  the  problem  of  entering  the  tables  using  estimated 
shapes  and  report  levels  of  4  to  6  percent  when  testing  at  a  nominal  5 
percent  level  with  the  true  levels  rising  to  7  or  8  percent  for  a  shape 
near  0  and  sample  sizes  of  20.  Schneider  and  Clickner  had  no  points  for 
a  shape  of  0.  When  they  encountered  estimated  shapes  smaller  than  the 
least  value  of  a  for  which  they  had  Monte  Carlo  points  they  were  forced 
to  use  some  sort  of  extrapolation.  This  may  have  led  to  erroneous  readings 
of  the  true  level  of  the  test.  We  plan  our  own  Monte  Carlo  studies  on 
this  point. 

Our  tables  list  two  shape  parameter  values  outside  the  parameter 
space — namely,  a  =  0,  a  =  «>.  These  points  are  provided  to  eliminate  the 
need  for  extrapolation  beyond  the  ends  of  the  tables.  They  were  computed 
using  the  following  theorem: 

Theorem  3.1:  Suppose  p^Cs.t)  is  a  sequence  of  non-negative  definite 
covariance  functions  of  mean  zero  Gaussian  process  { ( t ) ;  0  <_  tjf_l}. 

Suppose  that,  as  n  tends  to  infinity,  converges,  uniformly  in  s 

and  t,  to  p(s,t),  the  covariance  function  of  a  mean  zero  Gaussian 
process  {Y(t);  0_<  t<_l}.  Then  Y^  converges  weakly  to  Y  in  C  [  0 , 1  ] . 

1  2 

Hence,  under  the  conditions  of  the  theorem,  Yn(t)dt  converges  in 
1  2 

distribution  to  /  Y  (t)dt. 

2 

As  a  -*■  0  we  find,  for  W  ,  that  p(s,t)  converges  to 

(3.7)  pQ(s, t)-sln(s)tln(t)  . 

2  2  2 
For  A  the  limit  is  that  for  W  divided  by  w(s)w(t).  For  U  the  limit 

is  pp(s,t)  -  (sln(s)+l/4) (tln(t)+l/4)  where  now  p^  is  the  Case  0  covariance 
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for  U  .  We  remark  that  these  covariances  are  those  encountered  in  the 
one-parameter  exponential  distribution — see  Stephens  (1976)  . 

As  a  ->  »  the  limit  is  somewhat  harder  to  compute,  but  a  Cornish- 
Fisher  expansion  shows  that  the  covariances  converge  to  those  of  Case  3 
for  the  normal  distribution  as  defined  in  Stephens  (1976).  It  should  be 
noted  that  as  a  <*>  the  gamma  density  becomes  more  and  more  normal. 
This  suggests,  but  does  not  prove,  the  conclusion  just  reached  concerning 
the  critical  points. 

Although  critical  points  are  reported  in  Stephens  (1976)  for  Case  3 
of  the  normal  distribution  using  Pearson  curves  we  have  recomputed  the 
points  using  Imhof's  technique.  We  report  these  new  computations  here. 

4.  Conclusions. 

The  tests  studied  will  be  useful  in  practice.  The  convergence  of 
the  finite  sample  points  to  the  asymptotic  points  is  rapid.  In  addition 
the  asymptotic  points  depend  little  on  the  shape  for  shapes  larger  than 
1  or  2.  For  small  samples  and  estimated  shapes  near  0  the  asymptotic 
points  are  slightly  larger  than  the  finite  sample  points.  Since  the 
effect  of  entering  the  tables  with  an  estimated  shape  appears  to  be  an 
increase  in  the  frequency  of  rejection  use  of  the  asymptotic  points  even 
in  small  samples  is  to  be  recommended. 
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TABLE  1 


PERCENTAGE  POINTS  FOR  THE  NULL  DISTRIBUTIONS  OF  THE  TEST 
STATISTICS  W2,  U2,  AND  A2  IN  CASE  1 
KNOWN  SCALE 


Statistic  Shape  Significance  Level 

Parameter 


0.25 

0.10 

0.05 

0.025 

0.01 

0.005 

0 

.115 

.174 

.221 

.269 

.341 

.395 

0.2 

.113 

.169 

.214 

.260 

.322 

.368 

0.5 

.108 

.160 

.200 

.243 

.299 

.346 

2  1 

.103 

.150 

.186 

.223 

.273 

.311 

W2  2 

.0988 

.143 

.176 

.210 

.256 

.291 

5 

.0958 

.138 

.169 

.202 

.245 

.278 

10 

.0948 

.136 

.167 

.199 

.241 

.274 

20 

.0943 

.135 

.166 

.198 

.240 

.272 

00 

.0938 

.134 

.165 

.196 

.238 

.270 

0 

.0903 

.129 

.158 

.191 

.234 

.256 

0.2 

.0908 

.130 

.159 

.192 

.235 

.257 

0.5 

.0908 

.130 

.159 

.192 

.235 

.257 

1 

.0899 

.129 

.159 

.189 

.230 

.261 

2 

.0894 

.128 

.158 

.189 

.229 

.261 

5 

.0891 

.128 

.158 

.188 

.229 

.260 

10 

.0889 

.128 

.157 

.188 

.228 

.260 

20 

.0888 

.127 

.157 

.187 

.228 

.260 

oo 

.0888 

.127 

.157 

.187 

.228 

.259 

0 

.735 

1.06 

1.32 

1.59 

1.96 

2.24 

0.2 

.725 

1.04 

1.29 

1 . 55 

1.90 

2.17 

0.5 

.703 

.997 

1.22 

1.46 

1.78 

2.03 

2 

1 

.630 

.956 

1.17 

1.39 

1.69 

1.92 

A  ^ 

*» 

dm 

.661 

.926 

1.13 

1.34 

1.62 

1.34 

c 

.649 

.906 

1.13 

1.30 

1.59 

1.79 

10 

.645 

.899 

1.09 

1.29 

1.56 

1.77 

20 

.643 

.396 

1.09 

1.29 

1.56 

1.76 

CO 

.641 

.8^3 

1 .  09 

1.28 

1.55 

1.75 

Remarks : 

1)  Points 

for  shape  0 

are  from 

.  the 

one  parameter  exponential 

family — Case  4  of  Stephens  (1976). 

2)  Por'nts  for  shape  »  are  from  Case  1  of  the  normal  family — 
see  Stephens  (1976). 
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TABLE  2 


PERCENTAGE  POINTS  FOR  THE  NULL  DISTRIBUTIONS  OF  THE  TEST 
STATISTICS  W2,  U2 ,  AND  A2  IN  CASE  2 
KNOWN  SHAPE 


Statistic  Shape  Significance  Level 

Parameter 


0.25 

0.10 

0.05 

0.025 

0.01 

0.005 

0 

.209 

.  340 

.461 

.580 

.745 

.867 

0.2 

.150 

.257 

.339 

.423 

.542 

.629 

0.5 

.132 

.205 

.265 

.328 

.412 

.484 

1 

.116 

.174 

.222 

.270 

.337 

.399 

W 2  2 

.106 

.155 

.104 

.235 

.289 

.332 

5 

.0987 

.143 

.177 

.212 

.259 

.294 

10 

.0963 

.139 

.171 

.204 

.248 

.282 

20 

.0951 

.137 

.168 

.200 

.243 

.276 

oo 

.0938 

.134 

.165 

.196 

.238 

.270 

0 

.105 

.151 

.188 

.221 

.265 

.303 

0.2 

.0950 

.137 

.166 

.199 

.244 

.266 

0.5 

.0916 

.131 

.160 

.193 

.237 

.258 

1 

.0897 

.129 

.159 

.189 

.230 

.261 

2 

.0892 

.128 

.158 

.188 

.229 

.260 

5 

.0889 

.128 

.157 

.188 

.228 

.260 

10 

.0889 

.127 

.157 

.187 

.228 

.260 

20 

.0888 

.127 

.157 

.187 

.228 

.259 

OO 

.0888 

.127 

.157 

.187 

.228 

.259 

0 

1.25 

1.93 

2.49 

3.08 

3.88 

4.50 

0.2 

0.923 

1.41 

1.80 

2.22 

2.78 

3.22 

0.5 

0.803 

1.19 

1.50 

1.82 

2.27 

2.61 

1 

0.735 

1.06 

1.32 

1.59 

1.96 

2.24 

A2 

2 

0.692 

0.982 

1.21 

1.44 

1.76 

2.01 

5 

0.662 

0.929 

1.14 

1.35 

1.64 

1.86 

10 

0.652 

0.911 

1.11 

1.32 

1.59 

1.81 

20 

0.646 

0.902 

1.10 

1.30 

1.57 

1.78 

So 

0.641 

0.893 

1.09 

1.28 

1.55 

1.75 

lemarks : 

1)  Points 

for  shape  0 

are  the 

Case 

0  oints. 

2)  Points  for  the  shape  00  are  the  Case  1  points  of  the  normal 
distribution — see  Stephens  (1976). 
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TABLE  3 


PERCENTAGE  POINTS  FOR  THE  NULL  DISTRIBUTIONS  OF  THE  TEST 
STATISTICS  W2,  U2,  AND  A2  IN  CASE  3 


Statistic  Shape  Significance  Level 

Parameter 


0.25 

0.10 

0.05 

0.025 

0.01 

0.005 

0 

.115 

.174 

.221 

.269 

.341 

.395 

0.2 

.0924 

.136 

.168 

.204 

.252 

.296 

0.5 

.0823 

.119 

.147 

.175 

.214 

.244 

2  1 

.0785 

•  111 

.136 

.162 

.196 

.222 

W  2 

.0762 

.107 

.131 

.155 

.187 

.211 

5 

.0748 

.105 

.128 

.151 

.182 

.206 

10 

.0744 

.104 

.127 

.150 

.180 

.204 

20 

.0741 

.104 

.126 

.149 

.179 

.203  ' 

CO 

.0739 

.104 

.126 

.149 

.179 

.202 

0 

.0903 

.129 

.158 

.191 

.234 

.256 

0.2 

.0896 

.128 

.156 

.189 

.228 

.254 

0.5 

.0719 

.101 

.122 

.144 

.174 

.196 

2  1 

.0705 

.0984 

.119 

.141 

.169 

.190 

U2  2 

.0699 

.0973 

.118 

.139 

.166 

.187 

5 

.0695 

.0966 

.117 

.138 

.165 

.186 

10 

.0693 

.0964 

.117 

.137 

.164 

.185 

20 

.0692 

.0963 

.117 

.137 

.164 

.185 

GO 

.0692 

.0962 

.117 

.137 

.164 

.184 

0 

.735 

1.06 

1.32 

1.59 

1.96 

2.24 

0.2 

.543 

.756 

.923 

1.09 

1.33 

1.51 

0.5 

.502 

.685 

.825 

.967 

1.16 

1.31 

2  1 

.486 

.657 

.786 

.917 

1.09 

1.23 

A2  2 

.477 

.643 

.768 

.894 

1.06 

1.19 

5 

.472 

.635 

.758 

.881 

1.04 

1.17 

10 

.471 

.633 

.755 

.877 

1.04 

1.16 

20 

.470 

.632 

.753 

.875 

1.04 

1.16 

oo 

.469 

.631 

.752 

.873 

1.03 

1.16 

Remarks:  1)  Points  for  shape  0  are  from  the  one  parameter  exponential 

distribution — see  Stephens  (1976). 

2)  Points  for  shape  are  from  the  normal  distribution  for 
Case  3 — see  Stephens  (1976). 
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TABLE  4 


FIRST  FOUR  CUMULANTS  OF  THE  STATISTICS 
CASE  1—  KNOWN  SCALE 


3 

u 

3 

x  10 

k,  x  10 

Shape 

0 

Mean  x  100 

Variance  x  10 

3 

4 

9.26 

4.36 

6.39 

1.54 

0.2 

7.58 

4.03 

5.54 

1.25 

0.5 

8.66 

3.38 

4.05 

0.796 

1 

8.22 

2.84 

2.96 

0.507 

2 

7.87 

2.48 

2.35 

0.362 

5 

7.64 

2.27 

2.02 

0.291 

10 

7.56 

2.20 

1.92 

0.271 

20 

7.52 

2.17 

1.87 

0.262 

00 

7.48 

2.14 

1.83 

0.253 

U2 

k.  x  105 

k.  x  10^ 

Shape 

Mean  x  100 

Variance  x  10 J 

4 

4 

0 

7.13 

1.98 

1.68 

2.32 

0.2 

7.21 

2.01 

1.70 

2.35 

0.5 

7.22 

2.01 

1.70 

2.35 

1 

7.19 

1.99 

1.69 

2.33 

2 

7.15 

1.97 

1.67 

2.31 

5 

7.12 

1.96 

1.65 

2.29 

10 

7.11 

1.95 

1.65 

2.29 

20 

7.11 

1.95 

1.65 

2.23 

no 

7.10 

1.95 

1.65 

2.28 

A*" 

k^  x  10^ 

k,  x  10^ 

Shape 

Mean  x  10 

Variance  x  10 

4 

0 

5.96 

1.39 

10.89 

14.27 

0.2 

5.87 

1.31 

9.65 

11.97 

0.5 

5.69 

1.15 

7.69 

8.62 

1 

5.50 

1.02 

6.24 

6.40 

z. 

5.36 

0.939 

5.39 

5.20 

5 

5.26 

0.888 

4.91 

4.56 

10 

5.23 

0.872 

4.76 

4.37 

20 

5.21 

0.864 

4.69 

4.28 

5.19 

0.857 

4.62 

4.28 

oo 
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TABLE  5 


FIRST  FOUR  CUMULAHTS  OF  THE  STATISTICS 
CASE  2 — KNOWN  SHAPE 


W2 


,  .  .3 

Shape 

Mean  x  100 

Variance  x  102 

x  10 

k,  x  10 

4 

0 

16.66 

22.22 

8.47 

50.79 

0.2 

12.69 

11.52 

3.11 

13.28 

0.5 

10.54 

6.65 

1.29 

4.06 

1 

9.26 

4.35 

0.639 

1.54 

2 

8.44 

3.18 

0.372 

0.717 

5 

7.88 

2.53 

0.248 

0.397 

10 

7.68 

2.33 

0.213 

0.319 

20 

7.58 

2.23 

0.198 

0.284 

00 

7.48 

2.14 

0.183 

0.253 

U2 

Shane 

Mean  x  100 

Variance  x  10  2 

k^  x  10^ 

k.  x  105 
4 

0 

8.33 

2.77 

2.65 

3.97 

0.2 

7.53 

2.20 

1.90 

2.64 

0.5 

7,27 

2.04 

1.73 

2.39 

1 

7.18 

1.98 

1.68 

2.32 

2 

7.13 

1.96 

1.66 

2.30 

5 

7.11 

1.95 

1.65 

2.29 

10 

7.11 

1.95 

1.65 

2.23 

20 

7.10 

1.95 

1.65 

2.28 

00 

7.10 

1.95 

1.64 

2.28 

A2 

Shape 

Mean  x  10 

Variance  x  102 

k^  x  102 

k.  x  102 
4 

0 

10.00 

57.97 

104.32 

303.97 

0.2 

7.45 

29.21 

37.06 

76.48 

0.5 

6.51 

13.93 

18.33 

29.36 

1 

5.96 

13.92 

10.90 

14.27 

2 

5.61 

11.20 

7.45 

8.34 

5 

5.37 

9.59 

5.65 

5.61 

10 

5.28 

9.07 

5.12 

4.86 

20 

5.24 

8.82 

4.87 

4.52 

CO 

5.19 

8.57 

4.62 

4.19 
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TABLE  6. 


FIRST  FOUR  CUMOLANTS  OF  THE  STATISTICS 
CASE  3— -NEITHER  KNOWN 


Shape 

Mean  x  100 

3 

Variance  x  1CT 

k^  x  104 

k.  x  104 

4 

0 

9.26 

4.36 

6.39 

1.54 

0  .  2 

7.39 

2.40 

2.50 

0.430 

0 . 5 

6.66 

1.69 

1.37 

0.185 

1 

6.32 

1.41 

0.991 

0.115 

2 

6.13 

1.28 

0.833 

0.0892 

5 

6.02 

1.21 

0.754 

0.0772 

10 

5.98 

1.19 

0.731 

0.0738 

20 

5.96 

1.18 

0.721 

0.0722 

T- 

5.95 

1.16 

0.709 

0.0705 

U2 

Shane 

Mean  x  100 

Variance  x  10  2 

k3  x  104 

k.  >:  IQ3 

0 

7.18 

1.98 

1.68 

2.32 

0  o  2 

7.12 

1.93 

1.60 

2.15 

0.5 

5.78 

1.10 

0.647 

0.623 

1 

5.68 

1.03 

0.581 

0.531 

2 

5.63 

1.00 

0.552 

0.492 

5 

5.59 

0.987 

0.535 

0.470 

10 

5.58 

0.931 

0.529 

0.463 

20 

5.58 

0.978 

0.527 

0.459 

oo 

5.57 

0.975 

0.525 

0.455 

A2 

Shape 

Mean  x  10 

Variance  x  10 

k^  x  102 

k.  x  102 

4 

0 

5.96 

1.39 

10.89 

14.27 

0.2 

4.44 

0.614 

2.94 

2.39 

0.5 

4.11 

0.460 

1.75 

1.14 

1 

3.97 

0.406 

1.39 

0.808 

2 

3.91 

0.382 

1.25 

0.685 

5 

3.87 

0.370 

1.17 

0.625 

10 

3.86 

0.366 

1.15 

0.608 

20 

3.85 

0.364 

1.14 

0.600 

3.84 

0.362 

1.13 

0.591 
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2U .  ABSTRACT 


It  is  well  known  that  the  distributions  of  the  goodness-of -f it  statistics 
2  2  2 

W  ,  U  ,  and  A  do  not  depend  on  location  and  scale  parameters  even  when 
these  parameters  must  be  estimated.  It  has  generally  been  assumed,  however, 
that  when  shape  parameters  must  be  estimated  the  null  distributions  of  these 
statistics  would  depend  too  severely  on  the  unknown  parameters  to  permit  their 
use  in  practice.  Exact  asymptotic  distributions  of  these  statistics  are  pre¬ 
sented  for  the  two  parameter  gamma  family  of  distributions.  Critical  points 
are  given  in  tables  constructed  so  as  to  avoid  any  need  for  extrapolation. 

The  results  presented  show  that  the  tests  can  be  expected  to  be  useful  in 
practice . 
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